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ASYMPTOTIC DATA ANALYSIS ON MANIFOLDS 

By Harrie Hendriks and Zinoviy Landsman 

Radboud University Nijmegen and University of Haifa 

Given an m-dimensional compact submanifold M of Euclidean 
space R", the concept of mean location of a distribution, related to 
mean or expected vector, is generalized to more general Revalued 
functionals including median location, which is derived from the spa- 
tial median. The asymptotic statistical inference for general function- 
als of distributions on such submanifolds is elaborated. Convergence 
properties are studied in relation to the behavior of the underlying 
distributions with respect to the cutlocus. An application is given in 
the context of independent, but not identically distributed, samples, 
in particular, to a multisample setup. 

1. Introduction. Data belonging to some m-dimensional compact sub- 
manifold M of Euclidean space R s appear in many areas of natural science. 
Directional statistics, image analysis, vector cardiography in medicine, ori- 
entational statistics, plate tectonics, astronomy and shape analysis comprise 
a (by no means exhaustive) list of examples. Research in the statistical anal- 
ysis of such data is well documented in the pioneering book by Mardia [12] 
and more recently in [13] . Note that in these books, as well as in many re- 
search papers, the primary emphasis is placed on the analysis of data on 
a circle or a sphere. These are the simplest examples of compact manifolds 
and do not manifest the generic features of statistical inference intrinsic to 
compact submanifolds of Euclidean spaces. 

Let V be a family of probability measures on a manifold M C R s and let 

T : V -> R s 

be some s-dimensional functional. The expectation vector 
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is one of the most popular examples of such a functional. Another example, 
more important in the context of robustness, is the spatial median (see [4]) 

?2(P) = arginf / ||x — a\\ dP(x). 

Both of these functionals are special cases of the Frechet functional 
T Fr (P) = arginf f p(x,af dP(x), 

aGR s JR. 8 

where p is some metric in R s and j3 is some positive number (see details 
in [2]). Of course, Huber's M-functionals, as well as many others, can be 
considered. 

One would like to make statistical inference for data on the manifold, but 
in general, T{P) does not lie on the manifold. This is why we consider the 
"orthogonal" projection, or nearest-point mapping, 

7r : R s — > M, ir(x) = arginf ||m — x\\ 2 , 

toGM 

as the instrument for getting characteristics of the distribution P to ap- 
pear in the manifold. Unfortunately, the projection it is well defined and 
differentiable everywhere on R s , except on the set 

C = {x E R s | 7r(x) is not uniquely defined or the square distance function 

L X ((J>) = II/-* — x\\ 2 on M has a degenerate second derivative at fi = tt(x)}, 

which is called the cutlocus. For the sphere 5 S_1 , C consists only of the 
center, but for other manifolds, it may be more complicated (see, e.g., Sec- 
tion 6.3). 

Let X\ , . . . , X n be a sample of size n from the distribution P on the man- 
ifold M and let P n denote the empirical distribution. Then t n = T{P n ) is 
the empirical analogue of T(P) in R s and 7r(T(P n )) is the empirical ana- 
logue of 7r(T(P)) located on the manifold. In case T(P) = T\(P) = EX, one 
has T 1 (P n )=X n = with tt(Ti(P)) and n(Ti(P n )) being the 

mean location and sample mean location on the manifold, respectively. The 
asymptotic statistical inference for this functional is considered in [6, 7]. The 
concept of mean direction coincides with our concept of mean location when 
the manifold in question is the unit sphere. In [8] and [1], this situation is 
studied without any symmetry condition on the probability distributions. 
The present article deals with arbitrary compact submanifolds of R s . This 
may seem restrictive, but any compact manifold can be embedded in R s 
for some s. For example, submanifolds of projective space RP k can be em- 
bedded in Euclidean space using Veronese embedding (see [2]). Beran and 
Fisher [1] also consider the concept of mean axis, which would be within 
the realm of our approach, given such an embedding of the projective space 
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of dimension 2 into Euclidean space. In [2], the consistency of sample mean 
location as an estimator of mean location is investigated in the more general 
context of intrinsic and extrinsic means. 

For the case of T(P) = T2(P), that is, the spatial median, ir(T2(P)) and 
7r(T2(P n )) can be considered as median location and sample median loca- 
tion on the manifold M in the sense of Ducharme and Milasevic [4], who 
considered these concepts and developed some asymptotics for the case of a 
sphere. 

In this paper, we propose a general approach which allows one to study the 
asymptotic statistical inference for both mean location and median location 
functionals, together with many others. The underlying distribution P is 
allowed to depend on sample size n. Moreover, we do not require observations 
to be identically distributed. This essentially widens the framework of the 
applications, for instance to the multisample setup considered in Section 7.2. 
We do not even require that a sample consist of independent observations. 
Generally speaking, we do not require an underlying sample at all, only a 
sequence of statistics t n satisfying a suitable limit theorem. We found that 
the limit distribution does not need to be multivariate normal, but in our 
analysis, it needs to be spherically symmetric. Finally, one of the main issues 
of the paper is the investigation of the question as to how fast in n the spatial 
functional is allowed to approach the cutlocus if the convergence properties 
are still to hold. We supply an example clarifying the possible speed of 
approach. This will be stated in Section 2 and proved in Sections 4 and 5. 
In our results, we will make use of the idea of stabilization introduced in 
[7]. Section 3 is devoted to geometric properties of the projection mapping 
7r. In Section 6, the general results are illustrated for the sphere. In fact, 
they generalize the results of Hendriks, Landsman and Ruymgaart [8] and 
Ducharme and Milasevic [4]. In this section, the effect of the stabilization 
term is demonstrated. Section 6.3 provides a brief review of the ingredients of 
the main theorems for Stiefel manifolds. Section 7 is devoted to application 
of the main results. 

We will use the following notation: For t £ R s and a closed subset C C R s , 
d(t, C) denotes the minimal Euclidean distance between t and points of C . 
In particular, for C = {x}, we have d(t,C) = d(t,x) = \\t — x\\. The norm 
|| 5 1| of a matrix B will be the standard operator norm of linear transforma- 
tion associated with matrix B; see, for example, [11], Chapter 7, Section 4, 
Equation (2). Given a symmetric positive definite matrix B, its square root 
B 1 ' 2 is the unique symmetric positive definite matrix with the property that 
a sequence of matrices B n , B n — > B denotes convergence 
in operator norm or, equivalently, coefficientwise convergence. The notation 

T> 

Z n ^> Z denotes convergence in distribution of random variables Z n to Z 



4 



H. HENDRIKS AND Z. LANDSMAN 



and X = Y denotes equality in distribution of random variables. The nota- 
P 

tion Z n — ► Z denotes convergence in probability. This is used with Z = 0, in 
which case we may also write Z n = op(l). 

2. Main results. 

2.1. General setup. We consider the situation where a compact m-di- 
mensional submanifold M (without boundary) of R s is given. Let ir : R S \C — 
M be the nearest-point mapping, where C is the cutlocus, as defined in Sec- 
tion 1. Note that the cutlocus is a closed subset of R s . 

Let t n G R s be a sequence of spatial characteristics and t n £ R s be random 
vectors which we consider as estimators of t n , in the sense that 

(2.1) Z n = B~ 1 (i n -t n ) ^> Z asn->oo, 

where Z is some random vector in R s and the B n are nonsingular s x s 
matrices such that B n — > for n — > oo. In particular, it follows from (2.1) 

that \\t n -t n \\ -> 0. Denote /i„ = n[t n ), fi n = Tr(t n ). 

Remark 2.1. A simple situation is that an i.i.d. sample X\, . . . ,X n , 
is given where X\ is distributed with probability measure P n on R s (not 
necessarily related to the manifold M). Associated with the distribution 
P n is some characteristic t n = T(P n ) € R s , and we are interested in the 
"manifold part" /i n = n(t n ) of it. Furthermore one may define t n = T(P n ), 
where P n denotes the empirical distribution. If P n = P, then t n =t,fi n = fi, 
that is, they do not depend on n. This simpler, but important, specialization 
will be considered in the next subsection. 

Theorem 2.1. Suppose t n C and d(t n ,~M) < D for some D > 0. // 

(2.2) B n /d(t n ,C)^0, 
p 

then ||/x n — fi n \\ — > 0. 

Definition 2.1. Recall that a distribution Z is called spherical (see [5]) 
if for any orthogonal matrix H G O(s), HZ = Z. 

The most common example of a spherical distribution is the multivariate 
standard normal distribution. 

Remark 2.2. Note that for spherical Z and any r x s matrices A and B 
such that AA T = BB T , we have the equality AZ = BZ. This follows from 
property that the characteristic function fz(t) of Z is a function of ||f||. 
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Let T^M and iV^M = (T^M) be the tangent and normal spaces of M, 
respectively, at the point [i G M, considered as linear subspaces of R s . Let 
tan At (-) and nor At (-) = (I s — tan^)(-) denote the orthogonal projections onto 
TfjM. and iV^M, respectively. Here, l s denotes the identity mapping of H s . 
The s x s matrix-valued mapping M3/iw tan^ G Mat(s, s) is smooth since 
it can be expressed locally in terms of m smooth, independent tangent vector 
fields along M. Thus, fj, i— > nor„ is also smooth (cf. [9], pages 1-15). 

Remark 2.3. For spherically distributed Z = (Z\, . . . , Z s ) G R s , the dis- 
tribution of Z T tan ll Z = J27Li Zf and consequently does not depend on /x. 
This can be seen as follows. Given ^ G M, there exists an orthogonal ma- 
trix H such that tan^ = H T l s rn H, where I Sjm is a diagonal matrix, the first 
m diagonal elements of which are ones and the others zeros. Because of 
spherical symmetry, we have 

m 

Z T tan^Z = Z T H T I sm H Z = Z T I s ^ m Z = ^ Zf. 

i=i 

We will call its distribution where m = dim(M). Recall that for the 
standard multivariate normal distribution Z, this distribution coincides with 
the Xm"distribution. 

Recall that any normal vector Vn G iV^M determines a linear map, the 
Weingarten mapping ([9], pages 13-15), given by 

(2.3) A^:T ll M^T ll M:A^(w II ) = -t a n fl {D w ^v)), 

where v : M — > lR k is any smooth mapping such that v(a) G for all 

a G M and such that v{fj) = (e.g., v{a) = nor^t^)). D w (■) denotes 
coordinatewise differentiation with respect to the direction u;^ G T^M C R s . 
Both tan^ and the Weingarten mapping A Vfi are self-adjoint with respect 
to the Euclidean inner product and are therefore represented by symmetric 
s x s matrices. 

Let Id^ stand for the identity mapping of T^M. In [6] it was shown that 
the derivative of the projection ir has the form 

(2.4) Tr'(t) = (Id fl -A t _ fl )-him fM , 

where At-^ is the Weingarten mapping corresponding to the normal vector 
t — n and where fi = ir(t). Define 

(2.5) G n = (Id^ - A tn „ / , n )tan Atn + nor M „ = L - A^-^tan^ 

so that in particular, G n 7r'(t n ) = tan^ n . Note that G n is a symmetric matrix. 
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Theorem 2.2. In addition to the assumptions in Theorem 2.1, let T n 
be a sequence of s x s matrices such that 

(2.6) ||r n ||||£ n || 2 Mt„,c) 2 ^o. 

Then 

1. T n G n (ji n - n n ) - (r n tan Mn fi n )Z n 0. 

Furthermore, suppose that the limit distribution Z in {2.1) is spherical and 
let the matrix T n be chosen such that 

(2.7) rntaa^BnBjJtan^r^ = tan Mn . 

2. Suppose that /% — ► \l for some [i G M. Then 

T n G n (fi n - fi n ) = (r n tan Mn B n )Z n + o P (l) -> tan^Z. 

3. Without any restriction on [i n we have 

(2.8) (fl n - fMn) T G n T^T n Gn(fln ~ Mn) ^ Cm- 

Remark 2.4. Note that r n is not uniquely defined by condition (2.7). 
Sometimes it is convenient to choose T n such that it commutes with the pro- 
jection tan^ n as this implies that T n maps tangent vectors to tangent vec- 
tors and normal vectors to normal vectors. For example, T n = (a~ 2 nor Atn + 
tan Atn i? n i?^tan /Jn ) _1 / 2 for some suitable sequence a n - In this vein, T n is 
an invertible mapping, implying that in Theorem 2.2, item 3, G n T n F^G n 
represents a symmetric positive definite matrix. 

With respect to G n and the choice of T n in Remark 2.4, note that adding 
the normal part makes the linear transformations invertible and leads to 
confidence regions which are intersections of an ellipsoid with the manifold. 
Leaving G n and T n degenerate (G n and T n are nondegenerate on T^M) 
does not allow one to control normal directions and leads to a confidence 
region which is the intersection of a cylinder with the manifold, typically 
consisting of several disjoint pieces of the manifold. This adding of the nor- 
mal part we call stabilization. Another important role of stabilization, in the 
two-sample problem, is noted in [7], Remarks 1 and 5. 

Remark 2.5. In an application where G n and r n are not known, we 
suggest replacing them with their values corresponding to the empirical val- 
ues t n ,fin of t n ,/j, n (cf. [7]). In the same vein, instead of the transformations 

B n , some consistent estimator B n of B n , in the sense that B n B n — > I s , 
could be used. 



ASYMPTOTIC DATA ANALYSIS ON MANIFOLDS 



7 



Corollary 2.1. In case B n = a~ l V 1 ^ 2 , where a n is some sequence such 
that a n — > oo and V is a positive definite matrix, condition {2.2) of Theo- 
rem 2.1 simplifies to 

(2.9) a n d(t n ,C) oo. 

Taking T n = a n (nor Mn + tan^ Vtan Mn )~ 1//2 ; condition (2.6) of Theorem 2.2 
simplifies to a n d(t n , C) 2 — > oo. 

The conclusions remain true under the weaker assumption that B n — 
a~ x Vn 2 , where V* < V n < V**,n = 1,2,..., and matrices V n ,V* ,V** are 
positive definite. 

2.2. Underlying probability P does not depend on n. In this section, we 
return to the situation described in Remark 2.1. Suppose that neither the 
probability measure P n on the manifold nor the functional T n depends on 
n, that is, P n = P and T n = T, so t n = T n (P n ) = T(P) = t does not depend 
on n. Then the statements of Theorems 2.1 and 2.2 can be simplified. In 
fact, condition (2.2) is a consequence of the condition t ^ C. In case B n = 
a^V 1 / 2 , where a n is some sequence such that a n — > oo and V is a positive 
definite matrix, T n can be chosen as T n = a n (nor M7i +tan Mn V r tan Atn )~ 1 / 2 and 
condition (2.6) of Theorem 2.2 automatically holds. 

Theorem 2.3. Suppose that t^C and 

(2.10) Z n = a n V~ l l 2 (i n — t) Z as a n — > oo, 
where Z is some random vector in R s . Then: 

1. \\fl n ~ A* || 0. 

Furthermore, suppose that the limit distribution Z in (2.10) is spherical. 
Then 

2. a^nor^ + tan / ,V A tan At )" 1 / 2 G? n (/i n - Li) = 

((nor M + tan /J V A tan M ) _1 / 2 tan At y~ 1 / 2 )Z n + op(l) ^> tan^Z and 

3. a 2 (/z n - /i) T 67 n (nor At + tan At ytan At )~ 1 C7 n (/i n - Li) ^ CL where the 
limit distribution does not depend on /i, that is, is standard (see Remark 
2.3). 

If the covariance of the distribution P exists, and t = T\ (P) is the expected 
vector of P and t n = T\(P n ) is the sample mean vector, then one can choose 
a n = y/n and £ m will be the Xm distribution. In Section 7, we exhibit a case 
with a different choice of a n and C m . 
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3. Geometry. In this section, we collect the necessary results concerning 
the projection mapping ir. 



Lemma 3.1. Lett^C. Then 

Note that the inequality is sharp in the case where M is the sphere S r 
and t lies in its convex hull, the unit ball D m+1 . 



Proof. Consider t ^ C and let A ^ be the largest eigenvalue (in abso- 
lute value) of the symmetric linear transformation vr'(t). Let 7r(t) = [i. Prom 
(2.4) it follows that (A — 1)A~ X is an eigenvalue of A t _^. But the Weingarten 
mapping A t -^ depends linearly on t — fj,, as long as t — \x _L T M M. By look- 
ing at the path t a = a(t — /i) + fj,, with a running from 1 to A(A — 1) , 
we see that the largest eigenvalue of (Id M — A ta -^)~ 1 tan^ runs from A to 
oo. Therefore, if it is not the case that t a £ C for some a strictly be- 
tween 1 and A(A — 1) , then it is so for a = a± = A/(A — 1). Therefore, 
d(t,C) < \\t ai -t\\ = \\(ai - l)(t-fi)\\ = |A- l| _1 d(i,M). From this, it fol- 
lows that |A| < 1 + d(t,M)/d(t,C). □ 

We state one more lemma, giving the differentiability of the tangential 
projections and the Weingarten mapping. 

Lemma 3.2. The mapping M B fx — > tan M is C°° -differentiable in \i. Its 
values are symmetric s x s matrices. The Weingarten mapping 

K s x M B -» A^t^^tan^ 

is C 00 -differentiable on (£,/x). Its dependence on £ for any fixed \i is linear. 
Its values are symmetric s x s matrices. 



Note that the Weingarten mapping A u in some tangent space T^M is 
only defined for v _L T^M. This is the reason why £ appears in the form 
£ — tan^£ = nor^(£) in the above formula. The proof can be based on the 
ideas given in Section 2.1. 

The next lemma concerns the preimages of the mapping ir. It is required 
for the treatment of multisample data. 



Lemma 3.3. Suppose that to,ti ^ C and ir(to) = 7r(ti) = M e M. Let 
a £ [0, 1]. Then t a = (1 — a) to + ati ^ C and 7r(t a ) = fx. In other words, 
7r _1 {/i}\C is convex. 
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Proof. First we show that there exists a unique point on M closest 
to t a and that it is the point \i. Let x G M. A plane geometric calculation 
involving two applications of the cosine rule reveals that 

(3.1) \\t a — x\\ 2 = a\\ti — x\\ 2 + (1 — ck) ||*o — x \\ 2 ~~ \\ta — • \\t a — to\\. 

This would be minimal if both ||*o — ^|| and — x\\ were minimal, but 
this is the case precisely for x = fi. Thus, \\t a — x\\ reaches its minimum at 
the unique point x = \x. We still need to show that the function M3i-t 
Lt a {x) = \\t a — x\\ 2 has a nondegenerate second derivative at //. Equation 
(3.1) states that L ta = (1 — a)L to + aL tl up to a constant term. For a real- 
valued function / on M, let df{x) denote the differential of / at the point 
x. This means that df(x) G T*M is the dual vector, mapping any tangent 
vector v £T X ~M. to the derivative of / in the direction v. For a stationary 
point fi G M, that is, a point fi satisfying df(fj,) = 0, the Hessian H/ is 
defined as a symmetric bilinear form on T^M (see [15], pages 4-5). Since 
dLt a = (1 — a) dLt + adLt 1 at any point x G M, it follows that HLt Q = 
(1 — a)HLt +aHL tl at the stationary point \i. Since HX ta is positive definite 
for a = and a = 1 , it follows that it is positive definite for any < a < 1 . 
Together with the uniqueness of the nearest point, this means that t a G' C. 
□ 

4. Convergence in probability: Proof of Theorem 2.1. 

Proof of Theorem 2.1. First, note that for any differentiable function 
/ (real-, vector- or matrix- valued) , the following formula holds: 

(4.1) f(y) - f(x) = f 1 f'(x + 9(y - x))(y - x) d9. 

Jo 

Applying this formula to the vector-valued function vr(-), we obtain 

An - = / 7r'(*n + 0(*n ~ t n ))(t n - t n ) dd 

(4.2) Jo 

= / TT'(t nd )(t n -t n )d9 

Jo 

with t n g = t n + 8(t n - t n ). 

There now follows an ingenious argument, which simplifies a tedious cal- 
culation to an application of the continuous mapping theorem. Consider the 
event 

(4.3) F n = {d(t n ,t n ) < d(t n ,C)/2}. 

Note that from assumption (2.1), i n — t n = d(t n ,Q)d{t n ,Q)~ l B n Z n , where 

Z n ^ Z, and that because of assumption (2.2), d(t n ,C)~ l B n — > 0, so 
1 P 

d(t n ,C)~ B n Z n — ► and consequently, 

(4.4) P(F n ) > P(\\d(t n , Cy^nZj < 1/2) 1. 
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In the event F n , we have 

d(t n e, C) > d(t n , C) - d(t n , t n o) 
>d(t n ,C)-d(t n ,t n ) 
> d(t n , C) - d(t n , C)/2 > d{t n , C)/2. 

In particular, t n g ^ C and from Lemma 3.1, 



rf(t wg ,M) 
F (*ne)||iF„ < -777 — prv + 1 



(4.5) 



d(* n ,M) + rf(t w ,C)/2 
d(t„, C)/2 

<2^ M) +2. 



Lemma 4.1. Suppose P(i ? n ) - * 1- T/ien the following holds. Iflp n X n —* U , 

D P P 

then X n —¥ U {special case: if lp n X n — >0, then X n — > 0). 



Proof. \P{X n < u} - P{l Fn X n < u)}\ < P(i^) = 1 - P(F n ) - 0. □ 



sup 



Since from (4.5) we have 

Equation (4.4), together with Lemma 4.1, yields 

[\'{t ne )B n d6 4 0. 
J o 



+ 2 115, 



Moreover, from condition (2.1) we have Z n — > Z and (4.2) can be rewritten 



as 



l 

Hn-Vn=l n'(t n& )B n d6 Z n . 

10 



Hence, by the continuous mapping theorem, 

An — A*n — ► or, equivalently, \\j} n — fJ, n \\ 0. 



Thus, Theorem 2.1 is proved. □ 
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5. Limit law: Proof of Theorem 2.2. Let NM = {(/x, £) G R s x R s | /x G 

M, £ _L T U M} be the normal bundle of M in R s and let G : NM -> Mat(s, s) 
be the s x s matrix- valued mapping defined by Cr(/x,£) = (I s — A^t&n^), 
where denotes the Weingarten mapping (see (2.3)). Thus, G(ir(t),t — 
7r(t)) vr'(t) =tan 7r ( < ). Most importantly, G is a smooth mapping and £^ 
G(/i,£) is an affine mapping for every /x G M (see Lemma 3.2). 

In particular, since M is compact, there exists a constant K such that for 
all (//,£') G iVM, we have the inequality 

(5.i) ||G(/x,a - < k{\\» - A\ + 110 + - 0' + Oil)- 

Note that G n = G(7r(t n ),t n — n(t n )). Also, the mapping M 3 fj, — > tan u is 
smooth (Lemma 3.2) and because of the compactness of M, there exists a 
constant K\ such that for all /x,/x' G M, we have the inequality 



(5.2) 



|tan„ — tarn/ 1| < ifi||/x — /x'||. 



As in the proof of Theorem 2.1, let F n be the event defined in (4.3). Prom 
(4.2) and (4.5), we obtain, for some K2, 



d(p, n , /%)1f„ 



(5.3) 



n'(t n o)l Fn dO (i n -t n ) 

[ n' {t n o)l Fn B n dO Z % 
Jo 



- d{t n ,cy n 



We are going to show that T n G n (fl r 
start from the identity 



T n tann„B n Z n ->• 0. Let us 



(5.4) 



An - A*n - 7T' '{t n )(t n - t n ) = / {tno) - It' {t n )){t n - t n ) dO 



(Tr'(t n e) - it' (t n )) dO B n Z n . 



Then 



^nGnifin ~ l^n) ~ ^n^' !1 iJ, n B n Z n — T n G n (fl n — [l n — Tr'(t n )(t n — t n )) 

G n {7r'{t n e)-ir'{t n ))deB n Z n 



(5.5) 

Let n n e = ir(t n g) and G n g = G(ir(t ne ), t ne - Tr(t n0 )) = G(fi n g,t ne - Uno)- Then 
r n G n (7r / (t ne ) - n'(t n ))B n = T n (G n - G n e)-K'(t ne )B n 



(5.6) 



+ T n (tan 



tan M J.B n . 
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Using (4.5), (5.1), (5.2) and an obvious extension of the upper bound (5.3) 
to d(n n Q,fi n ) (which is applicable since in the event F n , the inequality 
d(t n 0,t n ) < d(t n , C)/2 also holds), and taking into account the fact that 

\\t n 6 — t n \\ = \\0(tn - tn)\\ < ll-^nll ||-Bn ~~ *n ) 1 1 = \\B n \\ \\Z n \\ 

in F n , we obtain the bound 

||(G„ " G ne y(t n g)\\ < 2K(\\^ ne - Mn || + \\t ne - t n \\) d{tn ^ + ^ C) 
(5.7) < 2^^1111^11 + ^,M) + ^,C) 



d(t n ,C)" "7 d(t n ,C) 

<?k(m K2 W |, ^n,M) + d(t n ,C) 

- 2K { 1 + d(t^c)P Bnl1 W^c) 11 nl 

We see that \\T n (G n - G n0 y(t n6 )B n \\ £ if ||r„||||fl n || 2 M* n , C) 2 0. 
Moreover, we have 

K2 \\B II 

(5.8) ||r n (tan Mne - tan^J^H < ||r n ||||B n ||ii'i— ^ — ^||Z n ||, 

so that ||r n (tan Mn£) -tan Mn )5 n || ^ if ||r n || ||B n || 2 /d(t n , C) -> 0. Since the 
t n 's are confined to a finite distance from the compact submanifold M, we 
also have that d(t n ,C) is uniformly bounded and the condition ||r n || ||-B n || 2 / 
d(t n ,C) — > is a consequence of condition (2.6). Under this last condition, 
the right-hand side of (5.6) converges to in event F n and thus the left-hand 
side of (5.5) converges to in probability. This proves item 1 of Theorem 2.2. 
For the proof of the second item, we use the fact that (r n tan Mn i? n ) x 

{T n tB.'a. lln B n ) T = tan Mn and therefore (r n tan Mn i? n ) is uniformly (in n) bounded. 

T> 

Moreover, since Z is a spherical distribution, we have T n tan^ n B n Z = tan^ n Z. 

T> 

Under the condition that fj, n — > fi, we have tan Mn Z — > tan M Z. Then item 2 
of Theorem 2.2 is a simple consequence of the following lemma: 

Lemma 5.1. Let A n (n = 1,2, ...) be linear transformations that are 
uniformly (in n) bounded in norm and let X n and X be random vectors. 

Suppose X n — > X and A n X — > W . Then A n X n — ► W. 

PROOF. Let teR s and let K = sup n We denote the characteris- 

tic function of a random vector Y by fy . Then for large n, \ fA n x n (t) — fA n x (t) \ : 
\fx n (Alt) - fx(A%t)\ < 8ap\\ a \\< K \fx n ( 8 ) ~ fx(s)\ < e, and for large n, 
\fA»x(t) - fw(t)\ < e. So, for large n, we have \fA n x n (t) - fw(t)\ < 2e. This 
proves the lemma. □ 



For the proof of item 3, we need the following: 
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Lemma 5.2. Suppose that X n (n = 1,2,... ) and X are random vectors 

in R s such that X n — > X. Let g be a continuous mapping. Suppose that A n 
(n = 1, 2, . . .) are linear transformations, uniformly (in n) bounded and such 

that g(A n X) — > W for all n. Then we also have g(A n X n ) — > W. 

Proof. First, we consider the case where the sequence A n converges to 
some A. Then the lemma is an easy consequence of the continuous mapping 
theorem. If A n is not convergent, reasoning by contradiction, suppose that 
for some t, the characteristic function of g(A n X n ) in t does not converge to 
fw(t)- Then for some e > 0, one can construct a subsequence rij for which 
\fg(A n .x n .)(t) ~ fw(t)\ > e and from uniform boundedness of A n , there exists 
a subsequence rij. for which A Hi . converges. This leads to a contradiction of 
the first case. The lemma is thus proved. □ 

From condition (2.7), it is clear that T n tan Aln i? n is uniformly bounded and 
according to Remark 2.3, we have r n tan^ n i? n Z = Q m . The lemma then 
yields ||(r n tan^ n £ n )Z n || 2 ^> Q 2 n . Thus ||r n G n (/%-/%)|| 2 = \\(T n t&n fln B n )Z n + 
op(l)|| 2 ^> Cm- Theorem 2.2 is now proved. □ 

6. Spheres and stabilization; Stiefel manifolds. Note that condition (2.6) 
is necessary for Theorem 2.2, even for the simplest case of the sphere. 
The following example shows this in the case of a circle and determinis- 
tic Z n . Recall that in the case of a sphere, M = 5 S_1 = {x G R s | ||x|| = 1}, 
C={0} (the origin), ir(t) = (t C), 7r'(t) = ||t || ^tan,,-^) and G n = 

||t n ||tan MTt + (I s -tan Mn ); see [7, 8]. 

6.1. Example of necessity of condition (2.6). Suppose that M = S 1 C 
R 2 . Let a n ,u n > be such that a n — ► oo and a n u n — > oo and let t n = 
(u n ,0) and i n = (itn^" 1 ), B n = a" 1 be such that condition (2.2) holds. 
Note that Z n = a n (t n - t n ) = (0, 1) = Z. Also, \i n = fj, = (1,0), fi n = (u n + 
a~ 2 )~ l l 2 (u n , a~ l ) and G n = ti n tan^ + (I s — tan M ). Taking T n as in Corol- 
lary 2.1, we have T n = a n . We find that 

r n G n (/i n — fi) = a n ( —— _„ — 1, 



(ui + an 2 ) 1 ' 2 \u 2 +an 2 ) 1 / 2 



Or, 



-2)1/2 y'( u 2 +a -2)l/2 



This should converge to tan^Z = (0,1). The second, tangential coordinate 
does have the correct limit, namely 

Un 1 1 2 

1 = 77~n W9VT72 ~ 1 ~ ~7;{ a nU n ) -^0, 



(ul + an 2 ) 1 / 2 (l + (a n u n )- 2 y/ 2 2' 
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but the first, normal coordinate 

U n A 1 , x_9 



a,, 



converges to only if a n u n — > oo, which corresponds exactly to condition 
(2.6). 



6.2. Relaxation of condition (2.6): Tuning the stabilization. In the above 
example, we have seen that the tangential part of T n G n (fl n — fi) has the 
desired limit behavior. The reason why the normal part does not behave 
appropriately, nevertheless, is the rough stabilization term (I s — tan Mn ) of 
G n . We may modify G n to G n = (ld Un - A t „_^ n )tan /in + e n (I s - tan^J, 
where e n is chosen sufficiently small, in order that condition (2.6) can be 
relaxed in the case of a sphere. It should be noted that the sphere is the 
only case known to us where such an improvement is possible. Even in the 
case of a noncircular ellipse, considered as a submanifold of the plane, with 
the cutlocus corresponding to the line segment connecting the focal points 
(see [6]), condition (2.6) cannot be relaxed by modifications of G n or T n in 
the normal directions. 

Theorem 6.1. Let M = S 8 ~ 1 . The conclusions of Theorem 2.2 hold, 
even when condition (2.6) is relaxed to 

(6.1) ||r n ||||fi„|| 2 Mt„,c)^o, 

if G n is replaced by the operator 

G n = (ld Un - A< n _ Atn )tan Atn +e„(I s - tan M J = \\t n \\tan Un + e n (l s - tan^J, 
where e n = 0(\\t n \\) . In particular, one can take e n = \\t n \\. Then 

(6.2) G n = \\t n \\l s . 

Corollary 6.1. In the case where B n = a^V 1 ^ 2 and T n is as in Corol- 
lary 2.1, that is, T n = a n (nor Un + tan Mn lAan Mn ) -1 / 2 , condition (6.1) coin- 
cides with the first condition (2.9) of Corollary 2.1, namely a n d(t n , C) — > oo. 

The conclusions remain true under the weaker assumption that B n — 
a^-Vn 2 , where V* < V n < V**,n = 1,2,..., and matrices V n ,V*,V** are 
positive definite. 

Corollary 6.2. In the case where the distribution of t n is rotationally 
symmetric about direction [i n , t n can be represented in the form 



t n = [i n u + v£, 
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where £ is uniformly distributed on the equator of the sphere, perpendic- 
ular to p n and independent of random variables u and v. Then if B n = 
V^) 1 / 2 , where V(-) is the covariance matrix <?/(•), it can be represented as 
B n = V^MnMn + \/%(Is ~ where 7 n =V XVnt n ) and f3 n = E(p n || 2 - 

(p^t n ) 2 )/(s — 1) (cf. [20], page 92). Moreover, T n can be chosen as F n = 

finals provided max^nPn 1 / 2 , f3n 2 ) /\\t n \\ —>0. 
This happens if 

1 n 

i n = arginf- Vp(||Xi -t\\), 
ten nf^i 

where p is some loss function and X\, . . . ,X n constitute a random sample 
from a rotationally symmetric distribution about direction p n . Depending on 
the choice of p, this is applicable, for example, to the expected vector and the 
spatial median. 

Proof of Theorem 6.1. For simplicity, we give the proof for e n = 
\\t n \\. Then G n - G n g = (\\t n \\ - ||t„e||)I s and therefore 

(6.3) \\G n — G n o\\ <\\t n — t n o\\. 

If in inequality (5.7), inequality (6.3) is used instead of (5.1), then \\p n g — pn\\ 

disappears and we obtain the improvement ||r n (G n — G n o)-K' {t n e)B n \\ —> 
if ||r n || \\B n \\ 2 / d(t n , C) — > 0. This change in the proof of Theorem 2.2 imme- 
diately leads to a proof of Theorem 6.1. □ 

6.3. Stiefel manifolds. We give a very brief review of the main ingredi- 
ents needed in the application of Theorems 2.1 and 2.2. More details can 
be found in [7]. We consider the Stiefel manifold V p _ r (r <p), understood 
as the submanifold of the vector space ofpxr matrices given by the equa- 
tion p T p = I r . The inner product structure for p x r matrices is given by 
(u,v) = Tt&ce(u T v) = Tr&ce(uv T ). The cutlocus C is the set of all matrices 
having rank less than r. Then for X £ C, that is, rank(X) = r, 

TT(x) = x(x T xy 1 / 2 

and for the matrix p n = ir(t n ) 6 V pr , t n £ C, 

tan Mn pf) = X - ^p n [p T n X + X T p n ] 

and 

1 1 

G n (X) = t&n Un (X)pTt n + -/i n tan Aln (X) T t„ - -t n t&n Un (X) T p n 

+ (X-tan^(X)). 

The following theorem makes explicit the distance between any p x r matrix 
and the cutlocus: 
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Theorem 6.2. Let C be the cutlocus of the Stiefel manifold V v<r . Let t be 
apxr matrix. Then the Euclidean distance oft to C equals d(t, C) = \Amin> 
where A m i n is the smallest eigenvalue oft T t. 

Proof. Note that for any pxr matrix u of rank less than r, there exists 
a unit vector w £ R r such that uw = 0. Given apxr matrix t of rank r, 
v = t — tww T is a rank r — 1 matrix and t — v = tww T is perpendicular to 
u — v. Thus, d(t,v) <d(t,u). Now, d(t,t — tww T ) 2 = \\tww T \\ 2 is minimal if 
w is the eigenvector associated with the smallest eigenvalue A m i n of t T t and 
then d(t, t — tww T ) 2 = X m - m - □ 

In the case of the sphere = V Si \, A m j n = t T t = ||i|| 2 . In the general 
case, a smooth lower bound for d(t, C), which is sharp in the case of the 
sphere, is given by 

d(t,C) 2 >Tr((t T t)^ 1 )- 1 . 

7. Applications. First, we will explain how the results of Hendriks and 
Landsman [7] fit into the approach adopted in this paper. In the aforemen- 
tioned work, the starting point is a probability measure P on a compact 
submanifold M of R s and an i.i.d. sample X\, . . . ,X n from distribution P. 
The investigated functional T is expected value. Corollary 2.1 is applicable, 
where one may take P n = P, P n the empirical distribution of the sample, 
T the expected value functional and, finally, a n = y/n. t n = E(X) = t € R s 
(the Euclidean mean of P) and i n = X n = \ £"=i X { G R s , the sample mean. 

= 7r(t) and /z n = ir(t n ) are the mean location and sample mean location, 
respectively. Of course, the spherical distribution Z is standard multivariate 
normal and the Cm distribution is simply Xm- Note that the approach in 
this paper allows for the making of inference on fj, n , even for a sequence of 
underlying probability measures P n depending on the sample size n [cf. Re- 
mark 2.1], for which the Euclidean means t n may converge to the cutlocus 
with a speed such that ^/nd(t n , C) 2 — > oo [for the case of a sphere, with G n 
as in Theorem 6.1, it is enough that y / nd(t n , C) = v^Pn|| — > °°]- 

7.1. Median location functional. In this subsection, we explain how the 
results in [4] with respect to median direction fit into our approach and can 
be generalized to the situation without the rotational symmetry requirement 
on the distribution of the sample, even to the situation of any compact sub- 
manifold of R s . Even the probability measure which generates the sample of 
size n may depend on n. Let P be a probability measure on a compact sub- 
manifold M of R s . Recall that the spatial median in R s is defined uniquely 
if the probability distribution is not supported by a straight line (see [14]). 
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Let M = S s_1 be the sphere in R s . Then consider Corollary 2.1 with 
a n = y/n, P n = P, P n the empirical distribution of the sample and T the 
spatial median functional, that is, 



(7.1) T{P) = arginf / \\X - a\\ dP. 

a£R s J 

Let t n = 7] = T(P), and let t n = T(P n ) = fj n be the sample spatial median. 
Let [i n = ir(r]) = rj/\\r]\\ = 9 and jl n = vr(rf n ) = 9 n be the median direction 
and sample median direction, respectively. Then our convergence condition 
(2.1) corresponds to [4], condition (3.1), and we immediately obtain the 
equation (3.2) of that paper from our Theorem 2.3, item 2, because a n = 
i/n, V = C~ 1 AC~ 1 and G can be taken as G = \\ri\\I s (see (6.2)); in the 
case of rotationally symmetric P about the mean direction 9, rj n has a 
rotationally symmetric distribution and T n can be taken as T n = (^/n/^/j3)I s 
(see Corollary 6.2), where [3 is as in [4]. Then the confidence region given in 
Theorem 2.2 conforms with the second confidence region of Ducharme and 
Milasevic [4]. Note that Theorem 2.2 gives the confidence region without any 
rotational symmetry assumption. As for the first confidence region given in 
[4], it has the disadvantage that if 9 belongs to a confidence region, then 
—9 also belongs to the same confidence region, so, in fact, it consists of two 
antipodal confidence regions. It suffers from the problem addressed after 
Remark 2.4. 

Theorem 2.2 immediately extends the results for spheres to Euclidean 
manifolds. Moreover, one can use different generalizations of spatial median 
functionals, as given, for example, in [17] and [3]. The simple converging 
algorithm for the derivation of spatial and related medians is given in [19]. 

Example 7.1. As an illustration of the techniques, we take the sample 
of size n = 14 on the circle from Ducharme and Milasevic [4] and produce 
the ingredients and 95% confidence region without a rotational symmetry 
condition. Then a n = y/n, the empirical median vector fj = (—0.661,0.647) 
and the empirical median location 9 = (—0.715,0.699) (i.e., 135.6°, as in 
loc. cit.). For V, we take its empirical version, 



V = C~ 1 AC~ 1 



0.148 0.201 
0.201 0.379 



for G, we take its empirical version, G = H^HL = 0.925 I s . We take T n = 
{y/n/ \J~f3\)I s , where (3\ is uniquely defined by the condition tang V tang = 
/3itan# (6 denotes the median location of the distribution, for rotationally 
symmetric measures (3± = (3 with f3 as defined in loc. cit.), and use its em- 
pirical form f n = (vW \Jlh)Is, where /?i is defined by tan^ytang = /3itang, 
giving Pi = 0.467. This leads to the confidence region (113.3°, 157.9°), which 
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is slightly wider than (114.3°, 157.2°) found in loc. cit. under rotational sym- 
metry conditions. 

7.2. Multisample setup. Suppose that we are provided with k (k fixed) 
independent samples on the manifold M C R s , 

(7.2) X a ,...,X in ., i = l,...,k. 

The main feature of the multisample setup is the dependence of the under- 
lying distribution P on n. Denote by a, = EXji and Ej = V(Xji) the mean 
expectation point and covariance matrix, respectively, of the iih sample, 
i = 1, . . . , k. Let n = Ya=x n i be the total number of observations and let 

t n = -S2riiai and t n = X n = - Y] Y] 
n f— f n . ' 

i=i i=i j=i 

so that t n is the average of all the observations. Suppose that t n ^ C and Ej 
is positive definite, i = 1, . . . , k. Denote 

'1 k X 



= vr(t„) =7T -^rtj 



This will be considered as the mean location of the multisample data (7.2). 
Furthermore, 

fin = n(X n ) 

is the sample mean location for the multisample data (7.2). Setting 

k \ 1/2 



(l k v 



we can verify that the multivariate version of the Lindeberg condition (see, 
for example, [10]) holds for n — > oo and consequently we have (2.1) with 
standard multivariate Gaussian limit Z. In fact, to apply [10], we reorganize 
Z n in (2.1) as 

4 = 1 J=l 

Then 

V(S n ) = I„ 
where I s denotes the identity matrix. Let 

A = min min \n , 

Ki<k KKs 
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where Xu,.-.,Xi s are the eigenvalues of the positive definite matrices Sj, 
i = 1, . . . , k, so A > 0. Note that 

Bl > n^Ms 

in the sense that B^ — n _1 AI s is nonnegative definite. Thus, 

||-Bn^|| 2 = x T B\x > Arz 1 1 1 a? 1 1 2 , 
WB^xf < A _1 n||a:|| 2 

and 

k 

L n (e) =J2 U * E \\ B n 1 ( Xil ~ a *)/ n H 2l {||B- 1 (X ll -a 1 )/n||>4 
i=l 

1 k 

i=l 

" li™ E|l(Xl1 " ai)l|2l {ll(^^ll>^ £ } ^° as n^oo. 
This establishes the Lindeberg condition. 

7.2.1. Confidence region. To apply Theorem 2.2 in order to clarify the 
asymptotic behavior of {jl n — fi), we should note that now, t n = J2i=i jt a i 
depends on n and may approach the cutlocus C of the manifold. If, however, 
condition (2.6) (for the case of sphere condition (6.1)) holds, then from item 3 
of Theorem 2.2, we have 

(ft'n l^n) G n Y n Y n G n (jl n l^n) * CmJ 

which provides a confidence region for \i n . Let us note that because B n has 
the form 

1/2 



r 1 ff y\ 



where on = ni/n, i = 1, . . . , k, and Ya=i a i = 1) we can use Corollary 2.1 and 
reduce condition (2.6) to 



k 

m 



(7.3) -v/"'^ —ai, C J — > oo as n, m, . .. ,n£ — > oo. 



1 n 

i=i 



As a matter of fact, (7.3) is a restriction on the behavior of nj, i = 1, . . . , k, 
dependent on n in the situation where the cutlocus intersects the convex hull 
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of vectors a±, . . . , a&. For the sphere, one may use Corollary 6.1 and then the 
condition simplifies to 



(7.4) 



u,' LCl 

\ . . n n J 



OO. 



In the following example, we illustrate condition (7.4). 

Example 7.2. Let M = S" 8 " 1 = {ieR s \\x\\ = 1}. Then C ={0} (the 
origin). Let k = 2 and suppose that a\ ^ and "f\a\ + 72O2 = for some 
7i > 0, 72 > 0. Then 



tr 



1 + 



71 

72 



n 



7i 



and t n may approach the cutlocus if ^ 



7i +72 



a 1 



-. Condition (7.4), in fact, 

71+72 V /: ) 

restricts the speed of these convergences, that is, (7.4) reduces to 



ni 
n 



71 



00 



as rai, n — > 00. 



7i +72 

In particular, if 02 = (02 G C), then 71 = and the condition is 

as rii, n^cxD. 



00 



7.2.2. Hypothesis testing. Suppose aj ^ C and let z/, = vr(aj), i = 1, . . . , k, 
be the mean locations on the manifold for each sample, where we suppose 
that v\ = • • • = Uk = fj,\. Suppose the null hypothesis 

(7.5) H :m=fi 

holds. Then from Lemma 3.3, we have 

k 



7T 



nidi 



/'• 



Moreover, this lemma says that the convex hull of a±,...,ak never inter- 
sects the cutlocus. This means that in spite of the underlying distributions 
depending on n, condition (7.3) holds automatically and from item 2 of 
Theorem 2.2, we have 



(7.6) T n G n (fl n - n) = (r n tan M £ n )Z n + o P (l 
while from item 3 of Theorem 2.2 we have 



v 



Af(0,tan M ), 



(An ~~ t l Y G n T n T n G n (fl n — fl) — ► Xrm 
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which provides a test for Ho. 

We now address the two-sample problem. Let 

Xa,...,X ini , i = l,...,A;i, and Yj X , ■ ■ ■ ,Y jtj , j = 1, . . . , h 



<-3 ' 

be two multisample sets of data on the manifold M having equal mean 
locations within each set, that is, 

v\ = ■■■ = v kl =/Xl, 

V\ = ■ ■ ■ = V k2 = /X 2 . 

Denote by a, = EXa and Ej = V(Xji) [resp. bj = EYji and Ej = V(Yji)] 
the expectation vector and covariance matrix of the ith sample, i = 1, . . . , k±, 
of X-data (resp. the jth sample, j = 1, . . . , of F-data). Let re = J2i=i n ii 
l = Y!tlih and 

^ fci ni k 2 ii 

i=l j=l i=l j—1 

be numbers and averages of all X-observations and all y-observations, re- 
spectively. Then 

(l kl \ (l k2 \ 

and 

v n = ir(X n ), vi = ir(Y e ) 

are mean and sample mean locations, respectively, for multisample data X 
and Y. Let us show how Theorem 2.2 provides a test for the hypothesis 
Hq ■ Mi = M2- Denote 

^ fci ^ fc 2 



U i=l ^ i=l 

*n = X n , tt£ = l^ 



and 



/ j *1 \ V2 ( I ki \ 1/2 



t=l/ \ i=l / 

Then the multivariate Lindeberg condition holds if re, I — > oo and we have 



Z 1>n = Baffin - t n ) -> Zi and 
Z 2/ = B~l(u e - m) ^> Z 2 , 
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where Z\ and Z2 are two independent standard s-dimensional normal dis- 
tributions, jV(0,I s ). Let Gi )U and G2/ (also Yi^ n and be matrices cor- 
responding to X-data and F-data and satisfying (2.5), (2.6) and (2.7). We 
suppose that Ti in and T2/ are chosen to be nonsingular; Gi >n and G<i,i are 
nonsingular by definition. 

Suppose the null hypothesis Hq : fJ,i = fJ,2 holds. Then we have 

v\ = • • ■ = Vki = vi = ■ ■ ■ = v k2 = pL. 
From item 1 of Theorem 2.2, we have (cf. (7.6)) 

(7.7) ri jn Gi jn (z> n - n) - (r lin tan /i Bi >n )Z lin ^> 0, 

(7.8) T 2 /G 2) e(vi - (j,)- (r2^tan /x J3 2 ^)2 , 2 I i ^ 0. 
Denote 

At = {r 1>n G ltn )-\ A2 = (T 2/ G 2/ r\ C = A 1 Aj + A 2 Al 

The matrix C is positive definite and it follows immediately from the defi- 
nition of C that the linear transformations C _1 / 2 A/, j = 1,2, are uniformly 
bounded in n and £, respectively. Therefore, from (7.7) and (7.8), we obtain, 
as n, I — > 00, 

C- X l 2 {v n - v e ) - C-^iA^tan^nZ^ - A 2 T2, e tan fl B 2/ Z2,e) ^ 0. 

As Z\ and Z2 are independent standard s-dimensional normal distributions, 
AA(0,I s ), one can straightforwardly obtain that 

(7.9) C-V 2 A 1 F lin ta.n li B l7n Z 1 - CT^^r^an^,^ = N(0, V), 
where, taking into account (2.7), 

V = C-^Viri.ntan^^B^tan^rX 

+ ^ara^tan^^^^tan^r^^C" 1 / 2 

= C" 1 / 2 [A 1 tan At ^ + A 2 tan fl A^}C^ 2 . 

Choosing ri jn , T2J to commute with tan^ (see Remark 2.4), we have ^jtan^ = 
t&n^Ai, i = 1,2, C _1 / 2 tan M = tan^C -1 / 2 and hence V = tan^. As the coef- 
ficients of Z\ and Z2 in (7.9) are uniformly bounded in norm [by the above 

and (2.7)], from Lemma 5.2 it follows that C~ l l 2 {i> n — vi) ^> N(0,tan^) and 
consequently that 

(7.10) 

+ G^ffi^r 1 ^]-^ - v t ) ^ xl- 
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To obtain a real test, one should substitute l\ n ,r2/ and G\^ n ,G%^ in (7.10) 
with their empirical analogues as follows (one can find more details in [7]): 

V2 / 1 fa \ V2 



/ 1 kl \ 1 _ / 1 2 



/l ~ ~ T \-V2 

ri, n = I -nor^ n + tan^Si^^^tan^ J , 

IV = 7nor^ +tan^S 2 ^5 2£ tan^ 



Ci^ — Is - ^x n -p n tan ^) — Is - -A Y/ _ 0< tan^, 

where Ej, E n i = 1, . . . , fci, r = 1, . . . , &2, are the sample covariance matri- 
ces of the subsamples of A"-data and K-data, respectively. Note that the 
asymptotic equation 

provides an asymptotic test for Hq without any knowledge about the value 
of the common mean location fi. 

7.3. Spherically symmetric stable limit distribution. Suppose, as in Sec- 
tion 2.2, that the underlying probability measure P n = P does not depend on 
n and that the functional T n = T does not depend on n. Suppose that P is 
a spherical probability distribution on the whole space R s (see Remark 2.1) 
and that the radial distribution has a regularly decreasing tail. Consider, for 
example, for some 5 > 0, C > and a G (0, 2), a sample X±, . . . , X n from the 
spherical distribution P, 

P{x£~R s : \\x- a|| >r} = Cr~ a , r>5, 
P{x E R s : jjx — a\\ > r} = 1, r < 5. 

Then (see [18], Section 7.5) limit condition (2.10) holds with t = a, t n = X n , 
a n = n 1 " 1 /" and V = i( Cr ^ff+^7 2 ° /2) ) 2/a Is, and the limit distribution Z 
has the characteristic function fz(t) = exp(— ||t|| a ) (t G R s ), that is, Z has 
a spherically symmetric stable distribution (see also [5], Section 3.5). The- 
orem 2.3 holds and asymptotic confidence regions are obtained, where (J^ 
(which is not the classical Xm distribution) has a distribution that does not 
depend on fi (see Remark 2.3). Moreover, the distribution of (Zi,...,Z m ) 

has characteristic function exp(— \\t\\ a ) (t £ R m ) and £ m = Y^i=\Zf . Nolan 
[16] gives several representations for the density of Cm = \/Cm"- O ne °f them, 
based on [21], equation (6), yields an expression for the density of £ m , 

1 f°° 

9(l( s2 ) = 2 m/2 r (m) s J Q (su) m/2 J m /2-i(su)exp(-u a )alu, 
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which can be tabulated (J p is the Bessel function of order p). In case a = 1, 
Z is just a multivariate Cauchy distribution; explicit analytic expressions 
can be found in [16]. 
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